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Abstract 

At the ultra high frequency level, the notion of price of an asset is very ambiguous. 
Indeed, many different prices can be defined (last traded price, best bid price, mid 
price,. . . ). Thus, in practice, market participants face the problem of choosing a price 
when implementing their strategies. In this work, we propose a notion of efficient price 
which seems relevant in practice. Furthermore, we provide a statistical methodology 
enabling to estimate this price form the order flow. 

Key words: Efficient price, order flow, response function, market microstructure, Cox 
processes, fractional part of Brownian motion, non parametric estimation, functional limit 
theorems. 



1 Introduction 



1.1 What is the high frequency price ? 

The classical approach of mathematical finance is to consider that the prices of basic prod- 
ucts (future, stock,. . . ) are observed on the market. In particular, their values are used in 
order to price complex derivatives. Since options traders typically rebalance their portfolio 
once or a few times a day, such derivatives pricing problems typically occur at the daily 
scale. 

When working at the ultra high frequency scale, even pricing a basic product, that 
is assigning a price to it, becomes a challenging issue. Indeed, one has access to trades 
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and quotes in the order book so that at a given time, many different notions of price 
can be defined for the same asset: last traded price, best bid price, best ask price, mid 
price, volume weighted average price,. . . This multiplicity of prices is problematic for many 
market participants. For example, market making strategies or brokers optimal execution 
algorithms often require single prices of plain assets as inputs. 

Choosing one definition or another for the price can sometimes lead to very significantly 
different outcomes for the strategies. This is for example the case when the tick value (the 
minimum price increment allowed on the market) is rather large. Indeed, this implies that 
the prices mentioned above differ in a non negligible way. 

In practice, high frequency market participants are not looking for the "fair" economic 
value of the asset. What they need is rather a price whose value at some given time 
summarizes in a suitable way the opinions of market participants at this time. This price 
is called efficient price. Hence, this paper aims at providing a statistical procedure in order 
to estimate this efficient price. 

1.2 Ideas for the estimation strategy 

In this paper, we focus on the case of large tick assets. We define them as assets for which 
the bid-ask spread is almost always equal to one tick. Our goal is then to infer an efficient 
price for this type of asset. Naturally, it is reasonable to assume that the efficient price 
essentially lies inside the bid- ask spread but we wish to say more. 

In order to retrieve the efficient price, our idea is to use the order flow. Indeed, it is 
often said by market participants that "the price is where the volume is not". Therefore, 
we assume that the intensity of arrival of the limit order flow at the best bid or the best 
ask level depends on the distance between the efficient price and the considered level: if this 
distance is large, the intensity should be high and conversely. Thus, we assume the intensity 
can be written as an increasing deterministic function of this distance. This function is 
called the order flow response function. In our approach, a crucial step is to estimate the 
response function in a non parametric way. Then, this functional estimator is used in order 
to retrieve the efficient price. 

Note that it is also possible to use the buy or sell market order flow. In that case, the 
intensity of the flow should be high when the distance is small. Indeed, in this situation, 
market takers are not loosing too much money (with respect to the efficient price) when 
crossing the spread. 

1.3 Organization of the paper 

The paper is organized as follows. The model and the assumptions are described in Section 
[2j Particular properties of the efficient price are given in Section [3] and the main statistical 
procedure is explained in Section [H The theorems about the response function can be 
found in Section [5] and the limiting behavior of the estimator of the efficient price is given 
in Section [6l Finally, the proofs are relegated to Section [7l 
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2 The model 



2.1 Description of the model 

We assume the tick size is equal to one, meaning that the asset can only take integer values. 
Moreover, the efficient price is given by 

P t = P + aW t , te[0,T], 

where (Wj )t>o is a Brownian motion on some filtered probability space (CI, T , (J-~t)t>o, IP) and 
.Po is a Fo-vaeasvLr&ble random variable, independent of (Wt)t>o and uniformly distributed 
on [po,po + 1), with po £ N. Note that such a simple dynamics for the efficient price is 
probably still reasonable at our high frequency scale. 

Let Y t be the fractional part of Pt, denoted by {Pt}, that is: 

Y t = {p t } = p t -yp t \. 

To fix ideas and without loss of generality, we focus in the rest of the paper on the limit 
order flow at the best bid level. We assume that when a limit order is posted at time t at 
the best bid level, its price B t is given by [Pt\ . Therefore at time t, the efficient price is 

P t = B t + Y t . 

We denote by Nt the total number of limit orders posted over [0, t]. In order to translate 
the fact that the intensity of Nt should be an increasing function of Yj, we assume that 
(Nt)t>o is a Cox process (also known as doubly stochastic Poisson process) with arrival 
intensity at time t given by 

nh(Y t ), 

where fx is a positive constant and where the response function h : [0, 1) — > R+ satisfies the 
following assumption: 

Assumption HI: Response function The function h is C 1 on [0, 1), with derivative h! 
for which there exist some positive constants c\ and C2 such that for any x in [0, 1), 

C\ < h (x) < C2- 

Moreover, we have the following identifiability condition: 

l 

h(x)dx = 1. 

Remark that HI implies in particular that h is bounded on [0, 1). The response function 
expresses the influence of the efficient price on the order flow. In particular, the limiting case 
where h is constant corresponds to orders arriving according to a standard Poisson process. 
This simple arrival mechanism is often assumed in practice for technical convenience, see 
for example jij, jHj. However, it is probably not very realistic. 

We assume that we observe the point process (N t ) on [0, T]. To get asymptotic proper- 
ties, we let T tend to infinity. It will be also necessary to assume that fj, = fix depends on 
T. More precisely, we have the following assumption: 
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Assumption H2: Asymptotic setting For some e > 0, as T tends to infinity, 

T 5/2+ 7/i T -»• o. 



Note that up to scaling modifications, we could also consider a setting where T is fixed 
and the tick size is not constant equal to 1 but tends to zero. 



3 Properties of the process Y t 

The intensity of the order flow is fj,Th(Yt). This process (Yt)t>o has several nice properties. 
3.1 Markov property 

First, recall that if U is uniformly distributed on [0, 1] and X is a real-valued random 
variable, which is independent of U then {U + X} is also uniformly distributed on [0, 1]. 
Thus, since Pq is uniformly distributed on [0, 1], we obtain that (Yj) is a stationary Markov 
process. Then, from the properties of the sample paths of the Brownian motion, it is clear 
that (Yt) is Harris recurrent and therefore, for / a bounded positive measurable function, 
almost surely, 

lim - / f(Y s )ds = [ f(s)ds, 
see for example Theorem 3.12 in 0]. 



3.2 Regenerative property 

Beyond the Markov property, the process {Yt) a ls° en j°y s a regenerative property. We define 
the sequence of stopping time (z/ n )neN the following way: uq = 0, v\ = inf {t > : Pt £ N} 
and for n > 2: 

v n = inf{t > v n _ x : P t = P Vn _, ± 1} = inf{t > u n _ x : W t = W Vn _, ± 1/a}. (1) 

The cycles (yi +I y n )o<t<^ n+1 -;/ n are independent and identically distributed for n > 1. Thus 
(Yt) is a regenerative process. Note that v<± — v\ has the same law as 

n = inf{i > Q,\W t \ = 1/a}. 

The Laplace transform of t\ is given by 

E[e" 7Tl ] = l/cosh(( v /27)/<r), 7 > 0. 

From this expression, we get E[n] = 1/cr 2 and E[r?l = 5/(3cr 4 ). Thus for / a bounded 



positive measurable function, by Theorem VI. 3.1 in 
lim — / 

T 



we get that almost surely 

1 r'i i rv2 pti 

lim-/ f(Y s )ds = - T E[/ f(Y t )dt]=a 2 E( f({aW t })dt). 

-^+ooT J E[z^2-^i] l J Pl Vo 
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In particular, this implies that 



a 2 E[ ^ f({aW t })dt) = ! f(s)ds. 
Jo Jo 



Furthermore, by Theorem VI. 3. 2, we get the following convergence in distribution as T — > oo: 
VT^ £ f(Y t )dt-a 2 E[£ f({aW t })dt]) 4 iV(0, <r 2 Var[^]), (2) 
with a centered random variable defined by 

Z f = P f(WW t })dt - na 2 E[ r f({aW t })dt] . 
Jo Jo 

We end this section with two particular cases for the function / which will be useful in 
the following. First, if / = h, using the identifiability condition in Assumption HI, we get 
that almost surely: 

lim i / h(Y t )dt = a 2 E[ f 1 h({aW t })dt] = 1. 

Therefore, 

Z h = H {l {Wt<0} (h(l + aW t )-l)+I {Wt>0} (h(aW t )-l))dt 
Jo 

= / ( 11 {«<0}( /i ( 1 + - 1) + I {u>0} (h{au) - l))L_ 1/aA/a (u)du, 

J-l/a 

where L_i/ a ^/ a is the local time of (Wt)t>o stopped at t\. 

In the same way, for f(x) = hh(x)<t}> t £ [0, Ml - ))) we have 

•l 

kh{s)<t}ds = h" 1 ^). 



So we define Z e (t) = Z? by 

z e(t) = / (\ws<o,h(i-+vW s )<t} + \ws>oM°Ws)<t} - h~ 1 (t))ds 







l/«r 

Wu<0,h(l+au)<i} + \u>0,h((Tu)<i} ~ h (t)) L_ 1 / a l / a (u)du. 

-1/a 

In fact, it will be useful to see (^ e (i))te[o,^(i-)) as a process indexed by t. Thus, we introduce 
the covariance function 

p(tx,t 2 ) = Cov[Z e (t 1 ),Z e (t 2 )]=E[Z e (t 1 )Z e (t 2 )}. (3) 

Note that this function and VarfZ' 1 ] can be computed as multiple integrals on [—1/a, 1/a] 2 . 
Indeed, for (u,v) € [—1/a, 1/a} 2 an explicit expression for the bivariate Laplace transform 
of 

{ L -l/a,l/a{u);L-l/^l/ a {v)) 

is available, see Formula (1.3.18.1) in 0|. Using differentiation, from this expression, one can 
easily derive^ E[L_ 1 / cr l / cr (u)L_ 1 / (Tjl / (T (t;)] which enables to deduce p(t\,t 2 ) and VarfZ' 1 ]. 

x We skip the explicit writing of E[i_ 1 / CTjl / cr (u)X_ 1 / CTjl / CT (w)], the expression being quite heavy. 
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4 Estimating the flow response function and its inverse 



We want to estimate the flow response function. Before estimating h, we need to estimate 
fix- Since 

h(Y t )dt]=~J E[h(Y t )]dt = l, 

it is natural to propose 

#T = ~y (4) 

as an estimator for nx- We have the following proposition, whose proof is given in Section 
O 

Proposition 1 If fix —> oo as T — >■ oo, we have 

y/T(fr-l) ^N(0,a 2 V a r[Z h }). 
flX 

We now explain how the estimator of the flow response function is built. Let kx be a 
known deterministic sequence of positive integers. Then define for j = 1, . . . , kx 



N jT/k T — N (j-l)T/k T _ 

fixT Nx 



Now remark that 9j is approximately equal to 

IfirT/krl 

fixT /kx ? ( N (j-i)T/k T +i/^ T ~ N {j _ 1)T /k T +{i-i)/^ T )- 

Conditional on the path of (it), the variables in the sum are independent and if T/kx is small 
enough, they approximately follow a Poisson law with parameter fixh{Yij_i\/j tT )/kx- There- 
fore, if moreover fixT/kx is sufficiently large, one can expect that 8j is close to h(Y^_ 1 ^ T /k T )- 
In fact, throughout the paper, we assume that kx is chosen so that for some p > 0, as T 
tends to infinity, 

TP+ 1 / 2 kxT 1 / 2 
—j^^O, -± >0. 

k T ' 2 

Note that since T^/ 2+£ / fix — > such a sequence kx exists. 

The 6j introduced above are kx estimators of quantities of the form h(uj), Uj £ [0,1]. 
However, we do not have access to the values of the U{. Nevertheless, we know that they 
are uniformly distributed on [0, 1]. We therefore rank the 9j: 6^ < 9(2) < • • • < @(k T )- ^or 
u £ [0, 1), we define the estimator of h(u) the following way: 

h(u) = %«fc T J)- 

Note that the identifiability condition holds since 

,1 -i k T 
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Then, the estimator of h 1 is naturally defined by the right continuous generalized 
inverse of h: 

1 3=1 

5 Limit theorems for the response function 

We now give the theorems associated to h ^ and h. For b > 0, we write D[0,b) (resp. 
D[0,b]), for the space of cadlag functions from [0,6) (resp. [0,6]) into R. We define the 
convergence in law in D[0, b) as the convergence in law of the restrictions of the stochastic 
processes to any compact [0, a], < a < b, for the Skorohod topology J\. We have the 
following result for h : 

Theorem 1 Under HI and H2, as T tends to infinity, we have 



(•) 



Mi") 



T(h-\.) - h-H-)) 4 o-G(-) - ^T^y I °G(v)dv, 

in D[0,h(l~)), where G(-) is a continuous centered Gaussian process with covariance func- 
tion p defined by ©. 

The same type of result holds also for h. 
Theorem 2 Under HI and H2, as T tends to infinity, we have 



_ Mi-) 
T(h{-) - h(-)) A -h'{-)o-G{h{-)) + h{-) I aG(v)dv, 



o 



in D[0,1). 



Note that both in Theorem Q] and Theorem [21 the covariance functions of the limiting 
processes can be computed explicitly, see Section 13.21 



6 Estimation of the efficient price 

Let t € (0, T\. A nice corollary of Theorem Q] is the possibility of estimating Yf, which is the 
fractional part of the efficient price at time t, and therefore the efficient price itself (provided 
t is an order arrival time). Indeed, we have an estimator h of h^ 1 and we know how to 
estimate h(Y t ), namely we 

N t - N f _ T/krr 

h{Y t ) = k T - 



fi T T 



Of course we can take any other time interval of length T/kr containing t. 
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Therefore, it is natural to define an estimator (Y t ) of Y t by 

Y t = h- 1 (MYt)). 

The following corollary shows that this estimator is indeed a good estimator of the efficient 
price Y t . 

Corollary 1 For t > 0, under HI and H2, as T tends to infinity, we have 

VT(Y t -Y t )Ao-G(h(Y t )), 

with G independent of Yt . 

Thus, thanks to this approach it is possible to retrieve the efficient price from the order flow, 
with the usual parametric rate yT. Remark that the law of o-G[h{Yt)) can be explicitly 
computed. Indeed, conditional on Yt we have a centered Gaussian random variable with 
variance o~ 2 p(h(Yt), ■ Then we can use the fact that Yt is independent of G and 

uniformly distributed. 

7 Proofs 

All the proofs are done under HI and H2. In the following, c denotes a constant that may 
vary from line to line, and even in the same line. 

7.1 Proof of Proposition [I] 

We give here an elementary proof of Proposition [TJ However, note that this result can also 
be obtained as a by product of Corollary [2] shown in Section 17.31 

The characteristic function of VT(fiT y Vt — 1) satisfies 

E[exp (itVr(fi T /n T - 1))] = exp(-itVT)E\E Y [exp (it T )] 

where Ey denotes the expectation conditional of the path of (Yt). Since conditional on the 
path of (Yt), Nt follows a Poisson random variable with parameter 

liT I h(Y t )dt, 
Jo 

the characteristic function is equal to 

expHt\/T)E[exp ((e^VT) 1 _ ^ j h (y t )dt 

Now 
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Using the boundedness of h and the fact that px goes to infinity, we get that the limit as T 
tends to infinity of the characteristic function is given by 

rT 



lim E 

T->-+oo 



exp((WT)(i^ h(Y t )dt-l 
From Equation ([2]), this is equal to 

eX p(-^Var[Z ft ]). 



7.2 An auxiliary result 

We now give an auxiliary result from which Theorem [1] and Theorem [2] will be easily deduced. 
For j = 1, . . . , kx, we set 



and define the function h e by 



h e (u) = 9n ukT \) = h(u) — , 
u J) Pt 



for u £ [0, 1). Note that 

Pt N T 



k T i 



p T Tfi T k T ^ 



1 1 f l „ 

— > 0,- = / h e (u)du. 



The right continuous generalized inverse of /i e is given by 



and satisfies 

PT 

We have the following proposition: 
Proposition 2 As T tends to infinity, we have 

d 



T{h-\)-h-\))AoG{.), 
in D[Q,h(l~)). 

By Theorem 15.1 in [2], it is enough to prove that the finite dimensional distributions 
converge and that a tightness criterion holds. The proof is splitted into two Lemmas: 
Lemma Q] for the convergence of the finite dimensional distributions and Lemma [2] for the 
C-tightness. 

Lemma 1 For any n £ N* , for any (t\, . . . , t n ) £ [0, h(l~)) n , as T — > oo, 

^/T{h-\U) - h-Hu))^ ^ A N(0,a 2 u) 
where Uij = p(U,tj), with p the covariance function defined in (J3j) . 
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Proof. We give the proof for n = 1. The other cases are obtained the same way by linear 
combinations of the components of y/Tfh^iti) — h~ l (ti)) 

We write h~ l {t) - h' 1 ^) = T 1 +T 2 + T 3 , with 

^ kr ^ kr 

l k r k T njT/kr 

T2 = T~ Yl htL P T ' k T h(Y )du<t\ ~ T^j hh(Y u )<t}du, 



1 f T 

T 3 =- J l {h{Yu) < t} du-h-\t). 



r-T 
10 

We study each component separately. We have 



7| 



with fj,fc T ,T = or 



- Uj.fcr.T = -1 if 



Oj<t<^ h(Y t )dt, 

1 J(j-l)T/k T 

' h(Y t )dt<t<0j. 

(j-l)T/k T 



T 

For £t > 0, we have E[|tij fc Ti r|] < A\ + ^2, with 

Ai = F[\^ h(Y u )du-t\<e T ], 

1 J(j-l)T/k T 

A *= K [h^T=±l}\ lt ^ J^T^ h{Yt)dt>£T} ] ■ 

We have 

,kr f jT/kT k T f jT/kT 

I V / HY t )dt - h(Y {j _ 1)T/kT )\ < c^- / \Y t - Y {j _ 1)T/kT \dt. 

1 J(j-l)T/k T 1 J(j-l)T/k T 

Therefore, for p > 0, 

'[1^ r' kT h(Y t )dt - h(Y {j _ 1)T/kT )\ > e T ] < P[ sup \W t \ > ce T ] < c^^'' 

1 J(j-l)T/k T ' te[0,T/k T ] E T 

Thus 



A x < P[\h(Y (j _ 1)T/kT ) -t\< 2e T ] + c (r/ y /2 . 
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Using the stationary distribution of ^(j-i)T/fc T we obtain that for et small enough, 
A l < h~\t + 2e T ) - h~\t - 2e T ) + c^%^ < ce T + c ^ 1 ^'' - . 



We now turn to A 2 . Remarking that \vj t k T ,T\ = 1 implies 
h T rjT/k T k r jT/k T 

J J(j-l)T/k T 1 J(j-l)T/ki 



we get 

i, «T/fe T 

A 2 <P[|^--^ / fc(y t )dt| >e T ]. 

1 J(j-l)T/k T 

Recall that conditional on the path of (Y t ), 6j = Zk? / '(utT), where Z is a Poisson random 
variable with parameter 

fjT/hr 
\*>T \ h(Y t )dt. 
J(j~^)T/k T 



Thus we obtain 



Using that 



we finally get 



A s kT 

A 2 < c — — T . 



k T 

n\T 1 \\< v 5>[K fcTiT |], 

E[|\/TTi|] < c£ T VT + c^—^- + c- ? ' 



We take £t = (t/VT, with £t tending to zero. Then 

^p+l/2 fc^Vr 

Thanks to the assumptions on kx, we can find a sequence Ct tending to such that E[|\/TTi|] 
goes to 0. 

We now turn to T 2 . We have 

T2 = t/2 OW r^/ k T h(Y )du<n - l {h(Y s )<t})ds 

1 ~[J(j-l)T/k T 1 t J(j-i)T/k T ^)du<ti 
1 ^ MT/fc 



J'=l 

with Wj,fc T)S ,T = or 



1 ^ 

r <L / w 3,k T ,s,Tds 

1 7^J(j-l)T/k T 
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w j,k T ,s,T = 1 if 



T 

Wj,k T ,t,T = -1 if 



T 



h(Y u )du < t < h(Y s 

(j-l)T/k T 



h„ HT/hr 

h(Y s ) <t<^- h(Y u )du. 

1 J(j-X)T/k T 



Now remark that \wj t k T ,s,T\ = 1 implies 

\h(Y s ) - -± / h{Y u )du\ > \t - ■§ / h{Y u )du\ 

1 J(j-l)T/k T 1 J(j-l)T/k T 



Using the same kind of computations as for T\, for e' T positive and small enough and p > 0, 
this leads to 

r, i , (T/k T ) p/2 

P[K*r,.,T| = 1] < ce' T + c y . 

Then, in the same way as for T\, we easily obtain that E[|\/TT2|] goes to 0. 

Eventually, from ([2]), we get that VTT3 converges in law towards a centered Gaussian 
random variable with variance cr 2 Var[Z e (i)]. 

□ 

We now give the second Lemma needed to prove Proposition [2j 
Lemma 2 Let 

a T (t) = Vr(^^ i {h{Ya) < t} ds-hr\t)). 

The sequence ( a T(^)) 0<t<ft ( 1 -- ) is tight in D[0, h(l~)). 

Proof. Recall first the following classical C-tightness criterion, see Theorem 15.6 in 0|: If 
for p > 0, pi > 1 and all < ti,t 2 < h(l~) 

E[\a T (h) - a T (t 2 )\ p ] <c{\h-t 2 \ pi ), 

then (aT(i)) <t</ l (i) is t[ g ht in D l°> H 1 "))- 

We now use the sequence of stopping time (^ ra ) ngN defined by dU). Let 

nx = inf{i : i>i > T}. 

From Theorem II. 5.1 and Theorem II. 5. 2 in [6J], we have 

EK T ] < cT (6) 

and 

E[(^ - T) 2 ] < cT. (7) 
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We now define 

Yi{hM) = 1 l {t 1<h (Y t )<t 2 }dt - ^{h~\t 2 ) - 

and 

1 r^r 

Y nT+ i(ti,t 2 ) = —j= I{ tl<h( Y t )<t 2 }dt. 

Remark that for i > 2, the Yi(t\,t 2 ) are centered (see Section [3.2p and iid. Moreover, using 
the occupation formula together with a Taylor expansion, we get 

E[(^ 2 (ti,t 2 )) 2 ] <T- 1 e[( r l {tl<h{Yt) < t2] dt) 2 ] < cT~ 1 |t 2 — ti| 2 E[(L*) 2 ], 

with 

L* = sup (i-i/ff.i/ffCt*)). 

ue[-l/er,l/er] 

Prom the Ray-Knight version of Burkholder-Davis-Gundy inequality, see 0] , we know that 
all polynomial moments of L* are finite and thus 

E[(y 2 (ii,i 2 )) 2 ] <cT- l \t 2 -t x \ 2 . 

We show in the same way that 

IE[(^i(ii,t 2 )) 2 ] +E[(F nT+1 (t 1 ,t 2 )) 2 ] ^cT-^-hl 2 . 

We now use the preceding inequalities in order to show that the tightness criterion holds. 
We have 

a T (ti) - a T (t 2 ) = B x + B 2 - B 3 + B 4 , 

with 

tit 

B 1 = Y 1 (t 1 ,t 2 ), B 2 = J2Y i (t 1 ,t 2 ), B 3 = Y nT+1 (t u t 2 ), 

i=2 

5 4 = v / T(^^-l)(/ i - 1 (t 2 )-/ l - 1 (t 1 )). 

Thus 

4 

E[|a T (ti)-aT(t 2 )| 2 ] <c^E0^| 2 ]. 

i=i 

By Theorem 1.5.1 in 

E[|^Fi(t l5 t 2 )| 2 ] <dE[nT]E[(y 2 (ti,t 2 )) 2 ]. 

8=2 

Moreover, using ([6]), we obtain 

E[n T ]E [\Y 2 {t x ,t 2 ) | 2 ] < c|ti — t 2 | 2 - 
Finally, we easily get from ([7]) that 

E[(-B 4 ) 2 ] < c\h -t 2 \ 2 . 

and therefore the tightness criterion is satisfied. □ 
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7.3 Proofs of Theorem [TJ and Theorem [2] 

In order to prove Theorem [TJ and Theorem [21 we start with the following corollary of Propo- 
sition [21 

Corollary 2 AsT^oo, 

■l 



T(V0) - h-\-), h e (.) - h(-), J h e (u)du - l) 
converges in law towards 



a 



G(-),-h'(-)aG{h(-)),- ] oG{v)dv). 



in D[0, h(l )) x D[0, 1) xR (for the product topology). 

Proof. The result follows directly from Proposition [2] together with Theorem 13.7.2 in 
and the continuous mapping theorem. □ 

We now give the proof of Theorem [TJ We write 

VT(h-\-)-h~ 1 (-))=V 1 + V 2 , 

with 

V 1 = y/T[{h-\-pTll>r) ~ h-^.fPr/iPr)), 
V 2 = VT{{h- l {-fi T / l i T )-h-\-)). 

From Proposition [2] together with the continuity of the composition map, see Theorem 13.2.2 
in Q], we have 

in D[0,h(l-)). Now recall that 

/ h e (u)du = fir/HT- 
Jo 

Thus using Corollary [2] together with Theorem 13.3.3 in j^|, we get 



^ -TTTT^TTTV / *G(v)dv, 



h(l~) 



h'{h-H.)) Jo 

in D[0, h(l~)). The convergences of VTVi and \/TV% taking place jointly, this ends the 
proof of Theorem [TJ 

From Theorem [TJ Theorem [2] follows from Theorem 13.7.2 in [jj. 
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7.4 Proof of Corollary [TJ 



Let h e (Yt) be the oracle estimate of h(Yt) defined the same way as h(Yt) but with (it instead 
of fix- Now we use that 

^(■S -«.-(■) 

jlT 

and the fact that the cycles on which h e (Y t ) is defined are negligible in the estimation of 
h~ l (-) and ht- Then, from Proposition [21 we have 

(yT[h-\- - h-H-)),K&)) 4 (aG(.),h(Y t )) 

in D[0,h(l~)) x M. + , with G independent of Yf. Using Theorem 13.2.1 in [8] and the fact 
that 

h(Yt) = ^hAYt), 

we derive 

^(h-\m)) - h-\h^))) A aG{h{Y t )). 

Now recall that 

\h-\hAYt)) -h^{h(Y t )) \ < c\hAYt) - h(Y t )\. 

and that conditional on the path of (Yt), h e {Yt) = Zkr / '(firT), where Z is a Poisson random 
variable with parameter 

i-t 



li T [ h(Y u )d 
Jt-T/kx 

Therefore, using a Taylor expansion, we get that 



E 

is smaller than 



\h e (Y t ) - h(Y t ) 



cE[ sup \Y U - Y t \ 2 ] + c(k T /(Tfi T )) < c(T/k T ) + c{k T / (T fi T )) . 

ue[t-T/k T ,t+T/k T ] 

Eventually, since T/yfkr and kTVT/^T tend to zero, we get 



TE 



h e (Y t ) - h(Y t )\ z < c(T 2 /k T ) + c{k T /nr) 0, 



which concludes the proof. 
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